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SYNTHETIC INHI BITORS OF HEPATITIS C VIRUS NS3 PROTEASE 

5 

BACKGROUND OF THE INVENTION 

1 0 Hepatitis C virus (HCV) is considered to be the major etiological 

agent of non-A non-B (NANB) hepatitis, chronic liver disease, and 
hepatocellular carcinoma (HCC) around the world. The viral infection 
accounts for greater than 90% of transfusion -associated hepatitis in U.S. 
and it is the predominant form of hepatitis in adults over 40 years of 

1 5 age. Almost all of the infections result in chronic hepatitis and nearly 
20% develop liver cirrhosis. 

The virus particle has not been identified due to the lack of an 
efficient in vitro replication system and the extremely low amount of 

20 HCV particles in infected liver tissues or blood. However, molecular 
cloning of the viral genome has been accomplished by isolating the 
messenger RNA (mRNA) from the serum of infected chimpanzees then 
cloned using recombinant methodologies. [Grakoui A. et al /. Virol 67: 
1385 - 1395 (1993)] It is now known that HCV contains a positive strand 

25 RNA genome comprising approximately 9400 nucleotides, whose 
organization is similar to that of flaviviruses and pestiviruses. The 
genome of HCV, like that of flavi- and pestiviruses, encodes a single 
large polyprotein of about 3000 amino acids which undergoes proteolysis 
to form mature viral proteins in infected cells. 

30 

Cell-free translation of the viral polyprotein and cell culture 
expression studies have established that the HCV polyprotein is 
processed by cellular and viral proteases to produce the putative 
structural and nonstructural (NS) proteins. At least nine mature viral 
35 proteins are produced from the polyprotein by specific proteolysis. The 
order and nomenclature of the cleavage products are as follows: 
NH 2 -C-El-E2-NS2-NS3-NS4A-NS4B-NS5A-NS5B-COOH. The three 
amino terminal putative structural proteins, C (capsid), El, and E2 ( two 
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envelope glycoproteins), are believed to be cleaved by host signal 
peptidases of the endoplasmic reticulum(ER) . The host enzyme is also 
responsible for generating the amino terminus of NS2 . The proteolytic 
processing of the nonstructural proteins are carried out by the viral 
5 proteases: NS2-3 and NS3, contained within the viral polyprotein. The 
NS2-3 protease catalyzes the cleavage between NS2 and NS3. It is a 
metalloprotease and requires both NS2 and the protease domain of NS3. 
The NS3 protease catalyzes the rest of the cleavages of the substrates in 
the nonstructural part of the polyprotein. The NS3 protein contains 631 

1 0 amino acid residues and is comprised of two enzymatic domains: the 
protease domain contained within amino acid residues 1-181 and a 
helicase ATPase domain contained within the rest of the protein. It is 
not known if the 70 kD NS3 protein is cleaved further in infected cells to 
separate the protease domain from the helicase domain, however, no 

1 5 cleavage has been observed in cell culture expression studies. 

The NS3 protease is a member of the serine proteinase class of 
enzymes. It contains His, Asp, and Ser as the catalytic triad. Mutation of 
the catalytic triad residues abolishes the cleavages at substrates NS3/4A, 
20 NS4A/4B, NS4B/5A, and NS5A/5B. The cleavage between NS3 and 
NS4A is mediated through an intramolecular enzymatic reaction, 
whereas the cleavages at NS4A/4B, 4B/5A, 5A/5B sites occur in a trans 
enzymatic reaction. 



V 



25 Experiments using transient expression of various forms of HCV 

NS polyproteins in mammalian cells have established that the NS3 
serine protease is necessary but not sufficient for efficient processing of 
all these cleavages. Like flaviviruses, the HCV NS3 protease also 
requires a cofactor to catalyze some of these cleavage reactions. In 

30 addition to the serine protease NS3, the NS4A protein is absolutely 

required for the cleavage of the substrate at the NS3/4A and 4B/5A sites 
and increases the efficiency of cleavage of the substrate between 5A/5B, 
and possibly 4A/4B. 

35 Because the HCV NS3 protease cleaves the non-structural HCV 

proteins which are necessary for the HCV replication, the NS3 protease 
can be a target for the development of therapeutic agents against the 
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HCV virus. Thus there is a need for the development of inhibitors of 
the HCV protease. 



SUMMARY OF THE INVENTION 

5 

The present invention fills this need by providing for a bivalent 
inhibitor of an hepatitis C NS3 protease comprised of a first peptide 
linked to a second peptide, said first peptide being a subsequence, 
mutated subsequence or a mutated full-length sequence of a substrate of 
1 0 the hepatitis C NS3 protease and said second peptide being a 

subsequence, a mutated subsequence or a mutated full-length sequence 
of a hepatitis C NS4A polypeptide. 



The present application further provides for an inhibitor of an 
1 5 HCV protease comprised of a peptide, said peptide being a subsequence, 
a mutated sutjpequence, or a mutated full-length sequence of a substrate 
of the HCV NS3 protease. 

The present application further provides for an inhibitor of an 
20 HCV NS3 protease comprised of a peptide, said peptide being a 

subsequence, a mutated subsequence or a mutated full-length sequence 
of an HCV NS4A polypeptide. 



The present invention further comprises a method for treating an 
25 individual infected with the HCV virus comprising administering an 
inhibitor of an HCV NS3 protease to said individual, said inhibitor 
being comprised of a first peptide linked to a second peptide, said first 
peptide being a subsequence, a mutated subsequence or a mutated full- 
length sequence of a substrate of the hepatitis C NS3 protease and said 
30 second peptide being a subsequence, a mutated subsequence or a 
mutated full-length sequence of a hepatitis C NS4A polypeptide. 

The present invention further comprises a method for treating an 
individual infected with the HCV virus comprising administering an 
35 inhibitor of an HCV NS3 protease to said individual, said inhibitor 
being comprised of a peptide, said peptide being a subsequence, a 
mutated subsequence or a mutated full-length sequence of a substrate of 
the HCV NS3 protease. 
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The present invention further comprises a method for treating an 
individual infected with the HCV virus comprising administering an 
inhibitor of an HCV NS3 protease to said individual, said inhibitor 
5 being comprised of a peptide, said peptide being a subsequence, a 
mutated subsequence or a mutated full-length sequence of an HCV 
NS4A polypeptide. 

The present invention further comprises a pharmaceutical 
1 0 composition for treating an individual infected with hepatitis C virus, 
said pharmaceutical composition being an inhibitor of an HCV NS3 
protease, said inhibitor being comprised of a first peptide linked to a 
second peptide, said first peptide being a subsequence, a mutated 
subsequence or a mutated full-length sequence of a substrate of the 
1 5 hepatitis C NS3 protease and said second peptide being a subsequence, a 
mutated subsequence or a mutated full-length sequence of a hepatitis C 
NS4A polypeptide, and a pharmaceutical carrier. 

The present invention further provides for a pharmaceutical 
20 composition for treating an individual infected with hepatitis C virus, 
said pharmaceutical composition being comprised of an inhibitor of an 
HCV NS3 protease and a pharmaceutical carrier, said inhibitor being a 
subsequence, a mutated subsequence or a mutated full-length sequence 
of a substrate of the HCV NS3 protease. 

25 

The present invention further provides for a pharmaceutical 
composition for treating an individual infected with hepatitis C virus, 
said pharmaceutical composition being comprised of an inhibitor of an 
HCV NS3 protease and a pharmaceutical carrier, wherein said inhibitor 
30 is comprised of a peptide, said peptide being a subsequence, a mutated 
subsequence or a mutated full-length subsequence of an HCV NS4A 
polypeptide. 



35 
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BRIEF DESCRIPTION OF THF. FIKI JRRS 

Figure 1 schematically depicts an embodiment of a bivalent inhibitor of 
the present invention. 

5 

Figure 2 depicts the recombinant synthesis of plasmid pBJ1015. 

Figure 3 depicts the recombinant synthesis of plasmid pTS56-9. 

10 Figure 4 depicts the recombinant synthesis of plasmid pJB1006. 

Figure 5 depicts the recombinant synthesis of plasmid pBJ1022. 

Figure 6 depicts the recombinant synthesis of plasmid 
15 pNB(-V)182A4AHT. 

Figure 7 depicts the recombinant synthesis of plasmid pT5His/HIV/183. 

DETAILED DESCRIPTION OF THE INVENTION 

20 

The teachings of all references cited are incorporated herein in 
their entirety by reference. 

The present invention are inhibitors of the HCV NS3 protease. 

25 The present invention relates to inhibitors of the HCV NS3 protease 

which inhibit either the interaction of a substrate or cofactor NS4A with 
the NS3 protease or a bivalent inhibitor which inhibits the interaction 
of the NS3 protease with both cofactor NS4A and a substrate of the NS3 
protease. Compared to inhibitors targeting only at a single binding site, 

30 bivalent enzyme inhibitors may provide additional advantages in terms 
of higher binding affinity (potency), as well as enhanced specificity 
against similar cellular host enzymes for reduced toxicity effects. 

Design Strateev of Bivalent Inhibitors of HCV NS3 Protease 

35 

The basic strategy for the design of bivalent inhibitors of HCV 
NS3 protease involved the devise of a molecular framework consisting 
of three individual components: 
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1. a region appropriate for binding to a substrate binding site; 

2. a region suitable for binding to the NS4A binding site; 

3. a flexible linker region connecting regions (1) and (2) which 
5 would allow the two end regions to bind to their respective binding 

sites. 

Schematically, this is represented by Figure 1 in which the 
substrate subsequence is depicted as block, 10, being attached to linker 12, 
1 0 and said linker 12 being attached to the polypeptide NS4A designated 14. 

Since the NS3 protease cleaves the HCV polyprotein at the 
NS3/4A, 4A/4B, 4B/5A and 5A/5B junctions, then subsequences of or 
mutated subsequences of these sites can be used as substrate inhibitors. 

1 5 A substrate inhibitor which is a subsequence of the inhibitor should be a 
subsequence which is prior to or after the cleavage site but preferably 
should not contain the cleavage site. A mutated subsequence or mutated 
full-length sequence of the substrate can be used if the cleavage site is 
mutated so that the cleavage of the substrate does not occur cleavage 

20 leads to mechanism-based inactivation of the protease. 

For example, the NS3/4A cleavage site contains the following 
sequence: 

Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu 

5 10 15 

Val Gly Gly Val Leu (SEQ. ID NO.: 26) 

20 

The cleavage site is between the threonine at position 10 and the serine 
30 at position 11. Any subsequence inhibitor should preferably be before the 
serine or after the threonine residue. Alternatively, a mutated 
subsequence or sequence can be produced by changing the 
threonine/serine cleavage site at position 10-11 to eliminate the 
cleavage site. 



25 



WO 97/43310 



PCTAJS97/07632 



-7- 

NS4A/4B contains the following sequence. 

Tyr Gin Glu Phe Asp Glu Met Glu Glu Cys Ser Gin His Leu Pro 

5 10 15 

5 Tyr lie Glu Gin Gly (SEQ ID NO.: 27). 

20 

The cleavage site is between the cysteine residue at position 10 
and the serine at position 11. Any subsequence should preferably be 
1 0 before the serine or after the cysteine, but should preferably not contain 
both the cysteine and the serine. Alternatively, a mutated subsequence 
or sequence can be produced by changing the cysteine/ serine cleavage 
site at position 10-11 to eliminate the cleavage site. 

15 NS4B/5A contains the following sequence. 

Trp He Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu 

5 10 15 

Arg Asp He Trp Asp (SEQ ID NO.: 28) 

20 20 

The cleavage site is between the cysteine at position 10 and serine at 
position 11. Any subsequence should preferably end before the serine or 
start after the cysteine but should preferably not contain both the serine 
25 and the cysteine. Alternatively, a mutated subsequence or sequence can 
be produced by changing the cysteine /serine cleavage sit at position 10 - 
11 to eliminate the cleavage site. 

NS5A/5B contains the following sequence. 

30 Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Thr Trp Thr 

5 10 15 

Gly (SEQ. ID NO.: 25) 



The cleavage site is between the cysteine at position 8 and the serine at 
35 position 9. Any subsequence should preferably end at the cysteine or 
start at the serine, but should preferably not contain both the cysteine 
and the serine. Alternatively, a mutated sequence or subsequence can be 
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produced by changing the cysteine /serine cleavage site at position 8 - 9 to 
eliminate the cleavage site. 



Linker 12 can be any chemical entity that can form a bond with 
polypeptides 10 and 14. Preferably the linker should be equivalent in 
length to a carbon chain having about 7-14 carbon residues. Examples of 
suitable linkers are two 6-aminocaproic acid (Acp) residues or an Acp 
and Lys wherein one of the polypeptides 10 or 14 form a peptide bond 
with the e amine of lysine. 

Examples of bivalent inhibitors of the present invention are the 
following: 

Glu-Asp-Val-Val-Cys-Cys-Acp-Acp-Cys-Val-Val-Ile-Val- 
Gly-Arg-Ile-Val-Leu-Ser-Gly-Lys (SEQIDNO: 1) 

Glu-Asp- Val -Val -Cys -Cys - Acp-Cys -Val -Val -Ile-Val - 
Gly-Arg-Ile-Val-Leu-Ser-Gly-Lys-Lys (SEQ ID NO:2) 

Glu-Asp-Val-Val-Cys-Cys-Acp-Xaa-Lys-Gly-Ser-Leu-Val- 
Ile-Arg-Gly-Val-Ile-Val-Val-Cys (SEQ ID NO: 3) 

wherein Xaa is a lysine residue having a peptide bond between its 
E-amino and the carboxyl group of the following lysine which forms a 
peptide bond with the glycine at position 10. Furthermore, the glutamic 
acid residue at position 1 may or may not be acetylated. 

Glu-Asp-Val-Val-Cys-Cys-Xaa-Lys-Gly-Ser-Leu-Val- 
Ile-Arg-Gly-Val-Ile-Val-Val-Cys (SEQ ID NO: 4) 

wherein Xaa is Lysine having a peptide bond between its e-amino 
and the carboxyl group of the following lysine which forms a peptide 
bond with the Gly; furthermore, the carboxyl group of the Xaa forms a 
peptide bond with the a-amino group of another lysine (not shown); 



Glu-Asp-Val-Val-Cys-Cys-Acp-Acp-Lys-Gly-Ser-Leu-Val- 
Ile-Arg-Gly-Val-ile-Val-Val-Cys (SEQ ID NO: 5) 
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wherein the amino acids at positions 9-21 are preferably D-amino 

acids; 

5 Glu -Asp- Val -Val-Cys-Cys -Acp-Ly s -Cys -Va 1 - Val - lie - 

Val-Gly-Arg-Ile-Val-Leu-Ser-Gly-Lys (SEQ ID NO: 6) 

wherein the lysine residue at position 8 has a peptide bond 
between the carboxyl of Acp and the a amino group of the lysine, and 
10 the e amino group of the lysine at position 8 forms a peptide bond with 
the carboxyl group of the cysteine residue at position 9 and the amino 
acid residues at positions 9-21 are preferably D-amino acid residues; 

Glu-Asp-Val-Val-Cys-Cys-Acp-Lys-Gly-Ser-Leu-Val-Ile-Arg- 
1 5 Gly-Val-Ile-Val-Val-Cys-Lys (SEQ ID NO: 7) 

wherein amino acid residues at positions 8-20 are preferably D- 
amino acid residues; 

20 Glu-Asp-Val -Val-Cys-Cys -Xaa-Cys-Val-Val-Ile-Val-Gly- 

Arg-Ile-Val-Leu-Ser-Gly-Lys (SEQ ID NO: 8) 

wherein Xaa is a Lys which forms a peptide bond between its e- 
amino acid and the carboxyl group of the Cys residue at position 8 and 
25 the carboxyl group of the Lys residue forms a peptide bond with an alpha 
amino group of another Lys residue (not shown), preferably the amino 
acid residues at positions 8-20 are D- amino acids. 

Examples of suitable monovalent inhibitors of the present 
30 invention are the following: 

Lys-Gly-Ser-Leu-VaHle-Arg-Gly-Val-Ile-Val-Val-Cys-Lys 
(SEQ ID NO.: 9) 

wherein the amino acid residues at positions 1- 13 are preferably 
35 D-amino acid residues; 

Lys-Gly-Ser-Leu-Val-Ile-Arg-Gly-Val-De-Val-Lys (SEQ ID NO.: 10) 
wherein amino acid residues at positions 1-11 are preferably 
D-amino acid residues; 



WO 97/43310 



PCT/US97/07632 



- 10- 

Cys-Val-VaMle-Val-Gly-Arg-Ile-Val-Leu-Ser-Gly-Lys 
(SEQIDNO.:ll) 

5 wherein the amino acid residues are preferably D-amino acid 

residues; 

Ser-Uu-Val-Ile-Arg-GIy-Val-Ile-Val (SEQ ID NO.: 12) 

0 wherein the amino acid residues are preferably D-amino acid 

residues and the serine residue at position 1 has been preferably 
acetylated; 

Lys-Gly-Ser-Leu-Val-Ile-Arg-Gly-Val-Ile-Val-Val-Cys 
5 (SEQ ID NO.: 13) 

wherein the amino acid residues are preferably D-amino acid 
residues the lysine residue at position 1 is preferably acetylated; 

Xaa-Lys-Gly-Ser-Leu-VaUle-Arg-Gly-Val-Ile Val-Val-Cys-Lys-Lys 
(SEQ ID NO.: 14); 

wherein Xaa is biotin and the amino acid residues at positions 2 - 
14 are preferably D-amino acid residues; 

Lys-Gly-Ser-Leu-Val-Ile-Arg-Gly-VaHle-Val-Val-Cys-Xaa-Lys 
(SEQ ID NO.: 15); 

Xaa is a lysine residue in which the e amino group of the lysine 
forms a peptide bond with a biotin, and amino acid residues at positions 
1-13 are preferably D-amino acid residues. 

The inhibitors of the present invention can be synthesized by a 
suitable method such as by exclusive solid phase synthesis, partial solid 
phase methods, fragment condensation or classical solution synthesis. 
The polypeptides are preferably prepared by solid phase peptide 
synthesis as described by Merrifield, J. Am. Chem. Soc. 55:2149 (1963). 
The synthesis is carried out with amino acids that are protected at the 
alpha-amino terminus. Trifunctional amino acids with labile side- 
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chains are also protected with suitable groups to prevent undesired 
chemical reactions from occurring during the assembly of the 
polypeptides. The alpha-amino protecting group is selectively removed 
to allow subsequent reaction to take place at the ammo-terminus. The 
5 conditions for the removal of the alpha-amino protecting group do not 
remove the side-chain protecting groups. 

The alpha-amino protecting groups are those known to 
be useful in the art of stepwise polypeptide synthesis. Included are 

1 0 acyl type protecting groups (e.g., formyl, trifluoroacetyl, acetyl), aryl 
type protecting groups (e.g. , biotinyl), aromatic urethane type 
protecting groups [e.g., benzyloxycarbonyl (Cbz), substituted 
benzyloxycarbonyl and 9-fluorenylmethyloxy-carbonyl (Fmoc)J, 
aliphatic urethane protecting groups [e.g., t-butyloxycarbonyl (tBoc), 

1 5 isopropyloxycarbonyl, cyclohexyloxycarbonyl] and alkyl type 

protecting groups (e.g., benzyl, triphenylmethyl). The preferred 
protecting groups are tBoc and Fmoc, thus the peptides are said to be 
synthesized by tBoc and Fmoc chemistry, respectively. 

20 The side-chain protecting groups selected must remain 

intact during coupling and not be removed during the deprotection 
of the amino-terminus protecting group or during coupling 
conditions. The side-chain protecting groups must also be 
removable upon the completion of synthesis, using reaction 

25 conditions that will not alter the finished polypeptide. In tBoc 

chemistry, the side-chain protecting groups for trifunctional amino 
acids are mostly benzyl based. In Fmoc chemistry, they are mostly 
tert.-butyl or trityl based. 

30 In tBoc chemistry, the preferred side-chain protecting groups 

are tosyl for Arg, cyclohexyl for Asp, 4-methylbenzyl (and 
acetamidomethyl) for Cys, benzyl for Glu, Ser and Thr, 
benzyloxymethyl (and dinitrophenyl) for His, 2-Cl-benzyloxycarbonyl 
for Lys, formyl for Trp and 2-bromobenzyl for Tyr. In Fmoc 

35 chemistry, the preferred side-chain protecting groups are 2,2,5,7,8- 
pentamethylchroman-6-sulfonyl (Pmc) or 2,2,4,6,7- 
pentamethyldihydrobenzofuran-5-sulfonyl (Pbf) for Arg, trityl for 
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Asn, Cys, Gin and His, tert-butyl for Asp, Glu, Ser, Thr and Tyr, tBoc 
for Lys and Trp. 



Solid phase synthesis is usually carried out from the carboxyl- 
terminus by coupling the alpha-amino protected (side-chain 
protected) amino acid to a suitable solid support. An ester linkage is 
formed when the attachment is made to a chloromethyl, chlortrityl 
or hydroxymethyl resin, and the resulting polypeptide will have a 
free carboxyl group at the C-terminus. Alternatively, when an amide 
resin such as benzhydrylamine or p-methylbenzhydrylamine resin 
(for tBoc chemistry) and Rink amide or PAL resin (for Fmoc 
chemistry) is used, an amide bond is formed and the resulting 
polypeptide will have a carboxamide group at the C-terminus. These 
resins, whether polystyrene- or polyamide-based or 
polyethyleneglycol-grafted, with or without a handle or linker, with 
or without the first amino acid attached, are commercially available, 
and their preparations have been described by Stewart et al (1984)., 
"Solid Phase Peptide Synthesis" (2nd Edition), Pierce Chemical Co., 
Rockford, IL.; and Bayer & Rapp (1986) Chem. Pept. Prot. 3, 3; and 
Atherton, et al. (1989) Solid Phase Peptide Synthesis: A Practical 
Approach, IRL Press, Oxford. 

The C-terminal amino acid, protected at the side-chain if 
necessary and at the alpha-amino group, is attached to a 
hydroxylmethyl resin using various activating agents including 
dicyclohexylcarbodiimide (DCC), N,N'-diisopropylcarbodiimide 
DIPCDI) and carbonyldiimidazole (CDI). It can be attached to 
chloromethyl or chlorotrityl resin directly in its cesium 
tetramethylammonium salt form or in the presence of triethylamine 
(TEA) or diisopropylethylamine (DIEA). First amino acid 
attachment to an amide resin is the same as amide bond formation 
during coupling reactions 

Following the attachment to the resin support, the alpha- 
amino protecting group is removed using various reagents 
depending on the protecting chemistry (e.g. , tBoc, Fmoc). The extent 
of Fmoc removal can be monitored at 300-320 nm or by a 
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conductivity cell. After removal of the alpha-amino protecting 
group, the remaining protected amino acids are coupled stepwise in 
the required order to obtain the desired sequence. 

5 Various activating agents can be used for the coupling 

reactions including DCC, DIPCDI, 2<hloro-l,3-dimethylimidium 
hexafluorophosphate (CIP), benzotriazol-l-yl-oxy-tris- 
(dimethylamino)-phosphonium hexafluorophosphate (BOP) and its 
pyrrolidine analog (PyBOP), bromo-tris-pyrrolidino-phosphonium 

1 0 hexafluorophosphate (PyBroP), N- [(lH-benzotriazol-l-yl) - 
(dimethylamino) methylene] -N-methylmethanaminium 
hexaflourophosphate N-oxide (HBTU) and its tetrafluoroborate 
analog (TBTU) or its pyrrolidine analog (HBPyU), (HATU) and its 
tetrafluoroborate analog (TATU) or pyrrolidine analog (HAPyU). The 

1 5 most common catalytic additives used in coupling reactions include 
4-dimethylaminopyridine (DMAP), 3-hydroxy-3,4-dihydro-4-oxo- 
1,2,3-benzotriazine (HODhbt), N-hydroxybenzotriazole (HOBt) and 1- 
hydroxy-7-azabenzotriazole (HOAt). Amino acid flourides or 
chlorides may be used for difficult couplings. Each protected amino 

20 acid is used in excess (>2.0 equivalents), and the couplings are usually 
carried out in N-methylpyrrolidone (NMP) or in DMF, CH2CI2 or 
mixtures thereof. The extent of completion of the coupling reaction 
can be monitored at each stage, e.g., by the ninhydrin reaction as 
described by Kaiser et al, Anal Biochem. 34:595 (1970). In cases 

25 where incomplete coupling is found, the coupling reaction is 

extended and repeated and may have chaotropic salts added. The 
coupling reactions can be performed automatically with 
commercially available instruments such as ABI model 430 A, 431 A 
and 433A peptide synthesizers. 

30 

After the entire assembly of the desired peptide, the peptide- 
resin is cleaved with a reagent with proper scavengers. The Fmoc 
peptides are usually cleaved and deprotected by TFA with scavengers 
(e.g., H2O, ethanedithiol, phenol and thioanisole). The tBoc peptides 
35 are usually cleaved and deprotected with liquid HF for 1-2 hours at -5 
to 0*C, which cleaves the polypeptide from the resin and removes 
most of the side-chain protecting groups. Scavengers such as anisole, 
dimethylsulfide and p-thiocresol are usually used with the liquid HF 
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to prevent cations formed during the cleavage from alkylating and 
acylating the amino acid residues present in the polypeptide. The 
formyl group of Trp and dinitrophenyl group of His need to be 
removed, respectively, by piperidine and thiophenol in DMF prior 
5 to the HF cleavage. The acetamidomethyl group of Cys can be 

removed by mercury(II) acetate and alternatively by iodine, thallium 
(III) trifluoroacetate or silver tetrafluoroborate which simultaneously 
oxidize cysteine to cystine. Other strong acids used for tBoc peptide 
cleavage and deprotection include trifluoromethanesulfonic acid 
1 0 (TFMSA) and trimethylsilyltrifluoroacetate (TMSOTf). 

In particular the peptides of the present invention were 
assembled from a Fmoc-Amide resin or a Fmoc-L-Lys- (tBoc) - Wang 
resin on an ABI model 433A synthesizer (Applied Biosystems, Foster 

1 5 City, CA) by solid phase peptide synthesis method as originally described 
by Merrifield, J. Am.Chem.Soc. 85:2149 (1963) but with Fmoc chemistry. 
The side chains of trifunctional amino acids were protected by tert.-butyl 
for Glu, Asp and Ser, trityl for Cys, tert.-butyloxycarbonyl (tBoc) for Lys 
and 2,2,5 / 7,8-r^ntamethylchroman-6-sulfonyl (Pmc) or 2,2,4,6,7- 

20 pentamethyldihydrobenzofuran-5-sulfonyl (Pbf) for Arg. N-a-Fmoc 
protected amino acids were pre-activated by HATU and l-hydroxy-7- 
azabenzotriazole (HOAt) prior to coupling to the resin. 
Dimethylsulfoxide (20%) was added during conditional extended 
coupling and Fmoc deprotection reactions. The synthesis of the 

25 inhibitors SEQ ID NOs: 1, 2, 5, 7, and 9-15 was accomplished by 

sequential and linear assembly of appropriate D- and L-amino acids and 
achiral amino acids (Gly and Ahx). The synthesis of the inhibitors SEQ 
ID NOs: 3, 4, 6, and 8 required orthogonal chain assembly anchored at a 
Lys residue whose side chain amino group was protected by l-(4,4- 

30 dimethyl-2,6-dioxocyclohex-l-ylidene)-ethyl (Dde). For example, for the 
preparation of the inhibitor SEQ ID NO 3, Ac-Glu-Asp-Val-Val-Cys-Cys- 
Acp-Lys-(Amide resin) (SEQ ID NO: 29) was first assembled. Then the 
Dde protecting group on the Lys residue was removed by 2% hydrazine 
in dimethylformamide (Bycroft, B.W. et al J. Chem. Soc. Chem. 

35 Commun. 1993, 778). Finally the second arm 

Cys-Val-Val-Ile-Val-Gly-Arg-Ue-Val-Leu-Ser-Gly-Lys (SEQ ID NO:30) 
was sequentially assembled from the side chain amino group. The 
assembled peptide was cleaved from the resin with simultaneous 



WO 97/433 10 PCIYUS97/07d32 

-15- 

deprotection of side chain protecting groups for three hours by 
trifluoroacetic acid (TFA) with proper scavengers (80% TFA : 4% phenol 
: 4% H2O, 4% thioanisole : 4% ethanedithiol : 4% triisopropylsilane). 
The cleaved peptide was separated from the resin by filtration and 
5 precipitated and repeatedly washed in anhydrous ethyl ether. The 

precipitated peptide was lyophilized in H 2 0 overnight. The lyophilized 
crude peptide was purified by reverse phase HPLC The purified peptide 
was further analyzed by HPLC, mass spectroscopy and amino acid 
analysis. 

10 

One can ascertain if a potential compound is effective as an 
inhibitor of the HCV NS3 protease by using a high throughput assay 
utilizing the NS3 protease, the NS4 cofactor and the peptide substrates, 
either 4B/5A or 5A/5B. These can be used to screen for compounds 

1 5 which inhibit proteolytic activity of the protease. One does this by 
developing techniques for determining whether or not a compound 
will inhibit the NS3 protease from cleaving the viral substrates. If the 
substrates are not cleaved, the virus cannot replicate. One example of 
such a high throughput assay is the scintillation proximity assay (SPA). 

20 SPA technology involves the use of beads coated with scintillant 

Bound to the beads are acceptor molecules such as antibodies, receptors 
or enzyme substrates which interact with ligands or enzymes in a 
reversible manner. 

25 For a typical SPA based protease assay the substrate peptide is 

biotinylated at one end and the other end is radiolabeled with low 
energy emitters such as 125 I or 3 H. The labeled substrate is then 
incubated with the enzyme. Avidin coated SPA beads are then added 
which bind to the biotin. When the substrate peptide is cleaved by the 

30 protease, the radioactive emitter is no longer in proximity to the 
scintillant bead and no light emission takes place. Inhibitors of the 
protease will leave the substrate intact and can be identified by the 
resulting light emission which takes place in their presence. 

35 Another example of a suitable assay technique is an HPLC assay in 

which the resultant reaction mixture containing the NS3 protease, the 
substrate products and the potential inhibitor is resolved on an HPLC 
column to determine the extent of the cleavage of the substrate. If the 
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substrate has not been cleaved or the cleavage has been inhibited, then 
only the intact substrate would be present or a reduced amount of the 
cleaved product will be shown to be present. If this is the case, then the 
compound is an effective inhibitor of the NS3 protease. 

5 

Pharmaceutica l Compnsitj j ft p R 

The dosage level of inhibitors necessary for effective 
therapy to inhibit the HCV NS3 protease will depend upon many 
1 0 different factors, including means of administration, target site, 

physiological state of the patient, and other medicants administered. 
Thus, treatment dosages should be titrated to optimize safety and 
efficacy. Typically, dosages used in vitro may provide useful guidance 
in the amounts useful for in situ administration of these reagents. 

1 5 Animal testing of effective doses for treatment of particular disorders 
will provide further predictive indication of human dosage. Various 
considerations are described, e.g., in Gilman, et al. (eds.) (1990) Goodman 
and Oilman's; The Pharmacologic! Bases of Therapeutics. 8th Ed., 
Pergamon Press; and Remington' s Pharmaceutical Sciences. 17th ed. 

20 (1990), Mack Publishing Co., Easton, Penn. Methods for administration 
are discussed therein and below, e.g., for oral, intravenous, 
intraperitoneal, or intramuscular administration, transdermal 
diffusion, and others. See also Langer (1990) Science 249:1527-1533. 
Pharmaceutical^ acceptable carriers will include water, saline, buffers, 

25 and other compounds described, e.g., in the Merck Index. Merck & Co., 
Rahway, New Jersey, lfig per kilogram weight of the patient to 500 mg 
per kilogram weight of the patient with an appropriate carrier is a range 
from which the dosage can be chosen. Slow release formulations, or a 
slow release apparatus will often be utilized for continuous 

30 administration. 

The inhibitors of the HCV NS3 protease of the present 
invention may be administered directly to the host to be treated or, 
depending on the size of the compounds, it may be desirable to 
35 conjugate them to carrier proteins such as ovalbumin or serum 

albumin prior to their administration. Therapeutic formulations may 
be administered in any conventional dosage formulation. While it is 
possible for the active ingredient to be administered alone, it is 
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preferable to present it as a pharmaceutical formulation. Formulations 
typically comprise at least one active ingredient, as defined above, 
together with one or more acceptable carriers thereof. Each carrier 
should be both pharmaceutical^ and physiologically acceptable in the 
5 sense of being compatible with the other ingredients and not injurious 
to the patient. Formulations include those suitable for oral, rectal, 
nasal, or parenteral (including subcutaneous, intramuscular, 
intravenous and intradermal) administration. The formulations may 
conveniently be presented in unit dosage form and may be prepared by 

1 0 any methods well known in the art of pharmacy. See, e.g., Gilman, et al. 
(eds.) (1990) Goodman and Gilman's : The Pharmacological Bases of 
Therapeutics. 8th Ed., Pergamon Press, Parry town, NY; Remington's 
Pharmaceutical Sciences. 17th ed. (1990), Mack Publishing Co., Easton, 
Penn.; Avis, et al. (eds.)(1993) Pharmaceutical Dosage Forms: Parenteral 

1 5 Medications 2d ed., Dekker, NY; Lieberman, et al. (eds.)(1990) 
Pharmaceutical Dosage Forms: Tablets 2d ed., Dekker, NY; and 
Lieberman, et al. (eds.)(1990) Pharmaceutical Dosage Forms: Disperse 
Systems Dekker, NY. The therapy of this invention may be combined 
with or used in association with other chemotherapeutic or 

20 chemopreventive agents. 

The following examples are included to illustrate but not to limit the 
present invention. 

Example 1 

Bivalent Inhibitors of HCV NS3 Protease 



25 



The bivalent inhibitors of defined by SEQ ID NOs.: 1-10 were 
synthetically produced as described above and tested for their ability to 
30 inhibit the HCV NS3 protease as follows. 



Into an aqueous solution containing 25 mM TRIS, 50 mM NaCl, 
.5 mM EDTA, 10% glycerol and .1% NP40 was placed the potential 
inhibitor, the HCV NS3 protease at a concentration of 0.05 \iM - 0.1 mM, 
35 the HCV NS4A cofactor at a concentration of 0.05 \iM - 0.1 \iM and the 
5A/5B substrate at a concentration of 50 jiM. This solution was then 
incubated for approximately 2 hours at 30°C after which the solution 
was applied to an HPLC to determine if the 5A/5B remained intact and 
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thus the compound was determined to be an inhibitor. However, if the 
HPLC showed that 5 A and 5B were present without the 5A/5B then the 
compound is not an inhibitor. The potential inhibitors were assayed at 
several different concentrations to determine the concentration which 
5 produced 50% inhibition of the HCV NS3 protease. The results are 
shown below. 

Inhibitor ICsoiuM) 

SEQ ID NO:l 0.6 
50571-120 

10 SEQ ID NO;2 3.0 

50962-13 

SEQ ID NO:3 3.0 
50828-001 

SEQ ID NO:4 3 - 30 
1 5 50962-22 

SEQ ID NQ5 02 
50571-144 

SEQ ID NO:6 2.0 
50571-150 

20 SEQIDNO:7 0.2 

50828-131 

SEQ ID NO:8 0.2 
50962-24 
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ExampHe 2 

Monovalent Inhibitors of the HCV NS3 Protease 

5 Examples of monovalent inhibitors of the HCV NS3 protease are 

as follows. 

Inhibitor ICsn( uM) 

10 SEQIDNO.:9 02 

50828-129 

SEQ ID NO.: 10 5 
50962-004 

SEQ ID NO.: 11 0.2 
1 5 50828-70 

SEQ ID NO.: 12 0.6 
50828-116 

SEQ ID NO.: 13 2.0 
50571-147 

20 SEQ ID NO.: 14 0.4 

50962-047 

SEQ ID NO.: 15 0.4 
50962-050 



25 



Examples 3 

30 

Production of HCV NS3 Protease 

A. Plasmid constructions. 

35 Several plasmids were designed and constructed using standard 

recombinant DNA techniques (Sambrook,Fritsch & Maniatis) to express 
the HCV protease in E. colt (Fig 2-7). All HCV specific sequences 
originated from the parental plasmid pBRTM/HCV 1-3011 (Grakoui el 
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fl/,1993). To express the N-terminal 183 amino acid versions of the 
protease, a stop codon was inserted into the HCV genome using 
synthetic oligonucleotides (Fig. 3). The plasmids designed to express the 
N-terminal 246 amino acid residues were generated by the natural Ncol 
5 restriction site at the C-terminus. 

i) Construction of the plasmid pBJl 015 (Figure 2) 

The plasmid pBRTM/HCV 1-3011 containing the entire HCV genome 

1 0 (Grakoui A., et al., /. Virol. 67: 1385-1395) was digested with the 

restriction enzymes Sea I and Hpa I and the 7138 bp (base pair) DNA 
fragment was isolated and cloned to the Sma I site of pSP72 (Promega) to 
produce the plasmid, pRJ201. The plasmid pRJ 201 was digested with 
Msc I and the 2106 bp Msc I fragment was isolated and cloned into the 

1 5 Sma I site of the plasmid pBD7. The resulting plasmid pMBM48 was 

digested with Kas I and Nco I, and the 734 bp DNA fragment after blunt 
ending with Klenow polymerase was isolated and cloned into Nco I 
digested, klenow polymerase treated pTrc HIS B seq expression plasmid 
(Invitrogen). The ligation regenerated a Nco I site at the 5* end and Nsi I 

20 site at the 3' end of HCV sequence. The plasmid pTHB HCV NS3 was 
then digested with Nco I and Nsi I, and treated with klenow polymerase 
and T4 DNA polymerase, to produce a blunt ended 738 bp DNA 
fragment which was isolated and cloned into Asp I cut, klenow 
polymerase treated expression plasmid pQE30 (HIV). The resulting 

25 plasmid pBJ 1015 expresses HCV NS3 (246 amino acids) protease. 

(ii) Construction of the plasmid pTS 56-9 with a stop codon after 
amino acid 183 (Figure 3) 

30 The plasmid pTHB HCV NS3 was digested with Nco I, treated 

with klenow polymerase, then digested with Bst Y I; and the DNA 
fragment containing HCV sequence was isolated and cloned into Sma I 
and Bgl H digested pSP72. The resulting plasmid pTS 49-27 was then 
digested with Bgl II and Hpa I and ligated with a double stranded 

35 oligonucleotide: 

GA TCA CCG GTC TAG ATCT 

t ggc cag atc taga (SEQ ID NO 18) to produce pTS 56-9. 
Thus, a stop codon was placed directly at the end of DNA encoding the 
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protease catalytic domain of the NS3 protein. This enabled the HCV 
protease to be expressed independently from the helicase domain of the 
NS3 protein. 

5 (iii) Construction of the plasmid pJB 1006 Fused with a peptide of 
positively charged amino acids at the carboxy terminus of NS3 183 
(Figure 4). 

The plasmid pTS 56-9 was digested with Sph I and Bgl II and the DNA 
1 0 fragment containing HCV sequence was isolated and cloned into a Sph I, 
Bgl II cut pSP72. The resulting plasmid pJB 1002 digested with Age I and 
Hpal and ligated to a double stranded oligonucleotide, 

CCG GTC CGG AAG AAA AAG AGA CGC TAG C 
AG GCC TTC TTT TTC TCT GCG ATC G 

1 5 (SEQ ID NO 19), to construct pJB 1006. This fused the hydrophilic, 
solubilizing motif onto the NS3 protease. 

(iv) Construction of the plasmid pBJ 1022 expressing His-NS3(183)-HT 
20 in E.coli (Figure 5) 

The plasmid pJB 1006 was digested with NgoM I and Nhe I and the 216 
bp DNA fragment was isolated and cloned into Ngo M I, Nhe I cut pBJ 
1015 to construct plasmid pBJ 1019. The plasmid pBJ 1019 was digested 

25 with Nar I and Pvu II, and treated with Klenow polymerase to fill in 5' 
ends of Nar I fragments. The expression plasmid pQE31 (Invitrogen) was 
digested with BamH I, blunt ended with Klenow polymerase. The 717 bp 
Nar I- Pvu II DNA fragment was isolated and ligated to the 2787 bp 
BamH I/Klenowed -Msc I (Bal I) fragment of the expression plasmid 

30 pQE31 (Invitrogen). The recombinant plasmid, pBJ 1022, obtained after 
transformation into E.coli expresses His NS3(2-183)-HT which does not 
contain any HIV protease cleavage site sequence. The plasmid also 
contains a large deletion in the CAT (Chloramphenicol Acetyl 
Transferase) gene. 

35 

(v) Construction of the plasmid pNB(-V)182-A4A HT (Figure 6) 
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The plasmid pMBM 48 was digested with Eag I and Xho I, treated with 
Klenow polymerase and the 320 bp DNA fragment was isolated and 
cloned into BamH I cut , blunt ended pSP 72 to construct the plasmid 
PJB1004. The 320 bp fragment encodes 7 amino acid from carboxy 
5 terminal of NS3(631), all of NS4A, and the amino terminal 46 amino 
acid of NS4B. The recombinant plasmid pJB1004 was digested with Eag I 
and Cel 2, blunt ended with Klenow polymerase. The 220 bp DNA 
fragment was isolated and cloned into the expression plasmid pQE30 
which was digested with BamH I and blunt ended with Klenow 
1 0 polymerase prior to ligation. The resulting plasmid pJB 101 1 was 
digested with NgoM I and Hind m and ligated to a double stranded 
oligonucleotide , 

CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC CAG GAA TTC 
15 GT TAA TAT GGA CTG TCC CTC CAA GAG ATG GTC CTT AAG 



20 



25 



30 



35 



GAT GAG ATG GAA GAG TGC CGG AAG AAA AAG AGA CGC A 

CTA CTC TAC CTT CTC ACG GCC TTC TTT TTC TCT GCG TTC GA 

(SEQ ID NO 20) 

to construct the plasmid pNB 4A HT. The plasmid pNB 4AHT was 
digested with Msl I and Xba I. The 1218 bp DNA fragment was isolated 
and cloned into Age I cut, klenow polymerase treated, Xba I cut vector 
DNA of pBJ 1019. The ligation results in a substitution of the 183rd 
amino acid residue valine by a glycine residue in NS3, and a deletion of 
amino terminal three amino acid residues of NS4A at the junction. The 
recombinant plasmid pNB182A4A HT comprising NS3(182aa)-G- 
NS4A(4-54 amino acid) does not contain NS3/NS4A cleavage site 
sequence at the junction and is not cleaved by the autocatalytic activity 
of NS3. Finally the plasmid pNB182A4A HT (SEQ ID NO 8) was digested 
with Stu I and Nhe I, the 803 bp DNA fragment was isolated and cloned 
into Stu I and Nhe I cut plasmid pBJ 1022. The resulting plasmid pNB(- 
V)182-A4A HT contains a deletion of the HIV sequence from the amino 
terminus end of the NS3 sequence and in the CAT gene (SEQ ID NO 23). 

(vi) Construction of the plasmid pT5 ^ HIV-NS3 fFigiirP 7) 
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The plasmid pTS56-9 was digested with Bgl II, and treated with 
Klenow polymerase to fill in 5' ends. The plasmid was then digested 
with NgoM I and the blunt ended Bgl II/NgoMI fragment containing 
the NS3 sequence was isolated and ligated to the Sgll, Klenow treated 
5 NgmMI cut and Sal I klenowed pBJ 1015. The resulting plasmid is 
designated pTSHis HIV 183. 



10 



15 



Example 4 

Purification of HCV NS3 Protease havin g a Solubilizing Motif 

Purification of His1 82HT fSEO TP NO 41 and 
His f-V)182A4AHT (SEP ID NO ffl 



The recombinant plasmids pBJ1022 and pNB(-V)182A4A were 
used to transform separate cultures of £. coli strain M15 [pREP4] 
(Qiagen), which over-expresses the lac repressor, according to methods 
recommended by the manufacturer. M15 [pREP4] bacteria harboring 

20 recombinant plasmids were grown overnight in broth containing 20g/L 
bactotrypton, lOg/L bacto-yeast extract, 5g/L NaCl (20-10-5 broth) and 
supplemented with lOOjig/ml ampicillin and 25*ig/ml kanamycin. 
Cultures were diluted down to O.D.600 of 0.1, then grown at 30°C to 
O.D.600 of 0.6 to 0.8, after which IPTG was added to a final concentration 

25 of ImM. At post-induction 2 to 3 hours, the cells were harvested by 

pelleting, and the cell pellets were washed with lOOmM Tris, pH 7.5. Cell 
lysates were prepared as follows: to each ml equivalent of pelleted 
fermentation broth was added 50*il sonication buffer (50mM sodium 
phosphate, pH 7.8, 0.3M NaCl) with Img/ml lysozyme; cell suspension 

30 was placed on ice for 30 min. Suspension was then brought to a final 
concentration of 0.2% Tween-20, lOmM dithiothreitol (DTT), and 
sonicated until cell breakage was complete. Insoluble material was 
pelleted at 12,000 x g in a microcentrifuge for 15 minutes, the soluble 
portion was removed to a separate tube and the soluble lysate was then 

35 brought to a final concentration of 10% glycerol. Soluble lysates from 

cells expressing the plasmids produce strongly immunoreactive bands of 
the predicted molecular weight. Soluble lysates prepared for Ni 2+ 
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column purification were prepared with 10mM B-mercaptoethanol 
(BME) instead of DTT. Lysates were stored at -80 °C . 



Purification HfrinP Ni^-Nitrosvl arerir aHH fMTA) af Ta rn^ (caAnvr^ 

The proteins were then purified by placing the extracted lysate on 
an NTA agarose column. NTA agarose column chromatography was 
used because the histidine tag which was fused to the N-terminus of the 
proteases readily binds to the nickel column. This produces a powerful 
affinity chromatographic technique for rapidly purifying the soluble 
protease. The column chromatography was performed in a batch mode. 
The Ni2+ NTA resin (3ml) was washed twice with 50 ml of Buffer A ( 
50mM sodium phosphate pH 7.8 containing 10% glycerol, 02% Tweert- 
20, lOmM BME). The lysate obtained from a 250 ml fermentation (12.5 
ml) was incubated with the resin for one hour at 4°C. The flow through 
was collected by centrifugation. The resin was packed into a 1.0 x 4 cm 
column and washed with buffer A until the baseline was reached. The 
bound protein was then eluted with a 20 ml gradient of imidazole (0- 
0.5M) in buffer A. Eluted fractions were evaluated by SDS-PAGE and 
western blot analysis using a rabbit polyclonal antibody to His-HIV 183. 



PORO? 



In an alternative method to purify the proteins the lysate containing the 
proteins were applied to a POROS metal-chelate affinity column. 
Perfusion chromatography was performed on a POROS MC metal 
chelate column (4.6 x 50mm, 1.7 ml) precharged with Ni*-. The sample 
was applied at 10 ml/min and the column was washed with buffer A. 
The column was step eluted with ten column volumes of buffer A 
containing 25 mM imidazole. The column was further eluted with a 25 
column volume gradient of 25-250 mM imidazole in buffer A. All 
eluted fractions were evaluated by SDS-PAGE and western blot analysis 
using rabbit polyclonal antibody. 

Example 5 



Peptide Synthesis of thr> 5A/5B and 4B /5A Substrates 
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The peptides 5A/5B and 4B/5A substrates (SEQ ID NOs 16, 18, 19, 20 and 
21) were synthesized using Fmoc chemistry on an ABI model 431 A 
peptide synthesizer. The manufacture recommended FastMoc™ 
5 activation strategy (HBTU/HOBt) was used for the synthesis of 4A 

activator peptide. A more powerful activator, HATU with or without 
the additive HOAt were employed to assemble 5A/5B substrate peptides 
on a preloaded Wang resin. The peptides were cleaved off the resin and 
deprotected by standard TFA cleavage protocol. The peptides were 
1 0 purified on reverse phase HPLC and confirmed by mass spectrometry 
analysis. 

Example 6 

15 HPLC-assav using a sy nthetic 5A/5B peptide substrate 

To test the proteolytic activity of the HCV NS3 protease the 
DTEDWCC SMSYTWTGK (SEQ ID NO 16) and soluble HCV NS3 (SEQ 
ID NO 27) were placed together in an assay buffer. The assay buffer was 

20 50mM sodium phosphate pH 7.8, containing 15% glycerol, lOmM DTT, 
0.2% Tween20 and 200 mM NaCl). The protease activity of SEQ ID NO 
27 cleaved the substrate into two byproduct peptides, namely 5A and 5B. 
The substrate and two byproduct peptides were separated on a reversed- 
phase HPLC column. (Dynamax, 4.6 x 250 mm) with a pore size of 300A 

25 and a particle size of 5^m. The column was equilibrated with 0.1%TFA 
(Solvent A) at a flow rate of 1 ml per minute. The substrate and the 
product peptide standards were applied to the column equilibrated in A. 
Elution was performed with a acetonitrile gradient (Solvent B=100% 
acetonitrile in A). Two gradients were used for elution (5% to 70%B in 

30 50 minutes followed by 70% to 100%B in 10 minutes). 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 
5 (i) APPLICANT: Schering Coip. 

(ii) TITLE OF INVENTION: Synthetic Inhibitors of Hepatitis C Virus 
NS3 Protease 

1 0 (iii) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Schering Corp. 

(B) STREET: 2000 Galloping Hill Road 
1 5 (C) CITY: Kenilworth 

(D) STATE: New Jersey 

(E) COUNTRY: USA 

(F) ZIP: 07033-0530 

20 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Apple Macintosh 

(C) OPERATING SYSTEM: Macintosh 7.1 
P) SOFTWARE: Microsoft Word 5.1a 

25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

30 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/644,544 

(B) FILING DATE: 10 May 1996 

35 (viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Dulak, Norman C. 

(B) REGISTRATION NUMBER. 31,608 
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(C) REFERENCE / DOCKET NUMBER: JB0595 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 908-298-5061 
5 (B) TELEFAX: 908-298-5388 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
1 0 (A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

1 5 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 
(A) NAME/KEY: 

20 Glu Asp Val Val Cys Cys Acp Acp Cys Val Val He Val Gly Arg 

5 10 15 

He Val Leu Ser Gly Lys 

20 

25 (2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
30 (C) STRANDEDNESS: 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: peptide 



35 (ix) FEATURE: 

(A) NAME/KEY: 
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Glu Asp Val Val Cys Cys Acp Cys Val Val He Val Gly Arg He 

5 10 15 

Val Leu Ser Gly Lys Lys 

20 

5 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 
0 (B) TYPE: amino acid 

(C) STRANDEDNESS: 
(D) TOPOLOGY: 



(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 
(A) NAME /KEY: 



Glu Asp Val Val Cys Cys Acp Lys Lys Gly Ser Leu Val He Arg 

5 10 15 

Gly-Val-lle-Val-Val-cys 

20 



(2) INFORMATION FOR SEQ ID NCfc4.- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
(D) TOPOLOGY: 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: 

(B) OTHER INFORMATION: Xaa is lysine having a peptide bond 
between its e-amino group and the carboxyl group of lysine at position 8. 
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The carboxyl group of the Xaa forms a peptide bond with the a-amino 
group of another lysine (not shown); 



Glu Asp Val Val Cys Cys Xaa 
5 5 
Val lie Val Val Cys 

20 



Lys Gly Ser Leu Val He Arg Gly 
10 15 



(2) INFORMATION FOR SEQ ID NO:5: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
1 5 (D) TOPOLOGY: 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 
20 (A) NAME /KEY: 

(B) OTHER INFORMATION: Amino acid residues at positions 9-21 
are preferably D-amino acid residues; 

Glu Asp Val Val Cys Cys Acp Acp Lys Gly Ser Leu Val lie Arg 

5 10 15 

25 Gly Val He Val Val Cys 

20 



(2) INFORMATION FOR SEQ ID NO:6: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
(D) TOPOLOGY: 

35 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 



10 
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(A) NAME/KEY: 

(B) OTHER INFORMATION: The lysine residue at position 8 has a 
peptide bond between the carboxyl group of Acp and the a amino group 
of the lysine, and the e amino group of the lysine at position 8 forms a 
peptide bond with the carboxyl group of the cysteine residue at position 
9 and the amino acid residues at positions 9-21 are preferably D-amino 
acid residues; 

Glu Asp Val Val Cys Cys Acp Lys Cys Val Val lie Val Gly Arg 

lie Val Leu Ser Gly Lys 

20 

(2) INFORMATION FOR SEQ ID NO:7: 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
20 (D) TOPOLOGY: 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 
25 (A) NAME /KEY: 

(B) OTHER INFORMATION: Amino acids at positions 8-20 are 
preferably D-amino acids. 



30 



35 



Glu Asp Val Val Cys Cys Acp Lys Gly Ser Leu Val He Arg Gly 

5 10 15 

Val lie Val Val Cys Lys 

20 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: 
(D) TOPOLOGY: 

(ii) MOLECULE TYPE: peptide 

5 

(ix) FEATURE: 

(A) NAME /KEY: 

(B) OTHER INFORMATION: 

Xaa is a lysine wherein the e amino group of which forms a peptide 
1 0 bond with the carboxyl group of the cysteine residue at position 8 and 
the carboxyl group of the lysine residue forms a peptide bond with an a 
amino group of another lysine residue (not shown), preferably the 
amino acid residues at positions 8-20 are D- amino acid residues. 

15 Glu Asp Val Val Cys Cys Xaa Cys Val Val lie Val Gly Arg He 

5 10 15 

Val Leu Ser Gly Lys 

20 

20 (2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
25 (C) STRANDEDNESS: 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 
(A) NAME /KEY: 

(B) OTHER INFORMATION: The amino acid residues at positions 
1- 13 are preferably D-amino acid residues and lysine at position 14 is 
preferably an L-amino acid residue; 



30 



35 



Lys Gly Ser Leu Val He Arg Gly Val He Val Val Cys Lys 

5 10 
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(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS: 
(D) TOPOLOGY: 

(ii) MOLECULE TYPE: peptide 

10 

(be) FEATURE: 

(A) NAME /KEY: 

(B) OTHER INFORMATION: Amino acid residues at positions 1 - 
11 are preferably D-amino acids; 

15 

Lys Gly Ser Leu Val He Arg Gly Val He Val Lys 

5 10 

INFORMATION FOR SEQ ID NO:l 1 : 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
25 (D) TOPOLOGY: 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 
30 (A) NAME/KEY: 

(B) OTHER INFORMATION: The amino acid residues are 
preferably D-amino acid residues. 

Cys Val Val lie Val Gly Arg He Val Leu Ser Gly 
35 5 10 

INFORMATION FOR SEQ ID NO:12: 



WO 97/43310 



PCTAJS97/07632 



-33- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
(D) TOPOLOGY: 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 
(A) NAME/KEY: 

(B) OTHER INFORMATION: The amino acid residues are 
preferably D-amino acids and the serine residue at position 1 is 
preferably acetylated; 

Ser-Leu-Val-Ile-Arg-Gly-Val-Ile-Val 

5 

INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
(D) TOPOLOGY: 

(ii) MOLECULE TYPE: peptide 

# 

(ix) FEATURE: 
(A) NAME/KEY: 

(B) OTHER INFORMATION: The amino acid residues are 
preferably D-amino acid residues and the lysine residue at position 1 is 
preferably acetylated. 

Lys Gly Ser Leu Val lie Arg Gly Val lie Val Val Cys 

5 io 

INFORMATION FOR SEQ ID NO:14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
5 (D) TOPOLOGY: 



(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 
(A) NAME/KEY: 

(B) OTHER INFORMATION: Xaa is biotin and the amino acid 
residues at positions 2 - 14 are preferably D-amino acids; 

Xaa Lys Gly Ser Leu Val lie Arg Gly Val lie Val Val Cys Lys 

5 io 

Lys 

INFORMATION FOR SEQ ID NO:15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
(D) TOPOLOGY: 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) OTHER INFORMATION: Xaa is a lysine residue in which the e 
amino group of the lysine forms a peptide bond with a biotin and amino 
acid residues at positions 1 - 13 are preferably D-amino acid residues. 

Lys Gly Ser Leu Val lie Arg Gly Val lie Val Val Cys Xaa Lys 

5 10 15 

(2) INFORMATION FOR SEQ ID NO:16: 



WO 97/43310 



PCT/US97/07632 



-35- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: HCV NS3 Protease 



GCG CCC ATC ACG GCG TAC GCC CAG CAG ACG AGA GGC CTC CTA GGG 45 
Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
15 10 15 

TGT ATA ATC ACC AGC CTG ACT GGC CGG GAC AAA AAC CAA GTG GAG 90 
Cys lie lie Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu 

20 25 30 



GGT GAG GTC CAG ATC GTG TCA ACT GCT ACC CAA ACC TTC CTG GCA 135 
Gly Glu Val Gin He Val Ser Thr Ala Thr Gin Thr Phe Leu Ala 

35 40 45 



ACG TGC ATC AAT 
Thr Cys He Asn 

ACG AGG ACC ATC 
Thr Arg Thr He 

ACC AAT GTG GAC 
Thr Asn Val Asp 

TCC CGC TCA TTG 
Ser Arg Ser Leu 



GGG GTA TGC TGG 
Gly Val Cys Trp 
50 

GCA TCA CCC AAG 
Ala Ser Pro Lys 
65 

CAA GAC CTT GTG 
Gin Asp Leu Val 
80 

ACA CCC TGC ACC 
Thr Pro Cys Thr 



ACT GTC TAC CAC 
Thr Val Tyr His 
55 

GGT CCT GTC ATC 
Gly Pro Val He 
70 

GGC TGG CCC GCT 
Gly Trp Pro Ala 
85 

TGC GGC TCC TCG 
Cys Gly Ser Ser 



GGG GCC GGA 180 
Gly Ala Gly 
60 

CAG ATG TAT 225 
Gin Met Tyr 
75 

CCT CAA GGT 270 
Pro Gin Gly 
90 

GAC CTT TAC 315 
Asp Leu Tyr 
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CTG GTT ACG AGG CAC 
Leu Val Thr Arg His 
5 no 



-36- 

100 

GCC GAC GTC ATT CCC 
Ala Asp Val lie Pro 

115 



105 

GTG CGC CGG CGA GGT 360 
Val Arg Arg Arg Gly 

120 



GAT AGC AGG GGT AGC 
Asp Ser Arg Gly Ser 

125 

0 

AAA GGC TCC TCG GGG 
Lys Gly Ser Ser Gly 

140 

5 GTG GGC CTA TTC AGG 
Val Gly Leu Phe Arg 

155 

GCG GTG GAC TTT ATC 
Ala Val Asp Phe He 

170 



CTG CTT TCG CCC CGG CCC 
Leu Leu Ser Pro Arg Pro 

130 

GGT CCG CTG TTG TGC CCC 
Gly Pro Leu Leu Cys Pro 

145 

GCC GCG GTG TGC ACC CGT 
Ala Ala Val Cys Thr Arg 

160 

CCT GTG GAG AAC CTA GAG 
Pro Val Glu Asn Leu Glu 

175 



ATT TCC TAC CTA 405 
lie Ser Tyr Leu 

135 

GCG GGA CAC GCC 450 
Ala Gly His Ala 

150 

GGA GTG ACC AAG 495 
Gly Val Thr Lys 

165 

ACA ACC ATG AGA 540 
Thr Thr Met Arg 

180 



TCC CCG GTG 
Ser Pro Val 

(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDN A 



(ix) FEATURE: 

(A) NAME/KEY: NS4A 
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AGC ACC TGG GTG CTC GTT GGC GGC 
Ser Thr Trp Val Leu Val Gly Gly 

1 5 



37- 

GTC CTG GCT GCT CTG GCC GCG 45 
Val Leu Ala Ala Leu Ala Ala 
10 15 



TAT TGC CTG TCA ACA GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC 90 
Tyr Cys Leu Ser Thr Gly Cys Val Val lie Val Gly Arg He Val 

20 25 30 



TTG TCC GGG AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC 135 
10 Leu Ser Gly Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr 

35 40 45 



15 



CAG GAG TTC GAT GAG ATG GAA GAG TGC 162 
Gin Glu Phe Asp Glu Met Glu Glu Cys 

50 



(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D) TOPOLOGY: double 

25 (ii) MOLECULE TYPE: cDNA 



GA TCA CCG GTC TAG ATCT 
T GGC CAG ATC TAGA 

30 (2) INFORMATION FOR SEQ ID NO:19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 
(A) NAME/KEY: 

5 CCG GTC CGG AAG AAA AAG AGA CGC TAG C 
AG GCC TTC TTT TTC TCT GCG ATC G 

(2) INFORMATION FOR SEQ ID NO:20: 

0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
(A) NAME /KEY: 

CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC CAG GAA TTC 
GT TAA TAT GGA CTG TCC CTC CAA GAG ATG GTC CTT AAG 

GAT GAG ATG GAA GAG TGC CGG AAG AAA AAG AGA CGC A 
CTA CTC TAC CTT CTC ACG GCC TTC TTT TTC TCT GCG TTC GA 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME /KEY: NS4A Active Mutant 



WO 97/43310 



PCT/US97/07632 



-39- 

Cys Val Val lie Val Gly Arg He Val Leu Ser Gly Lys 

5 10 

(2) INFORMATION FOR SEQ ID NO:22: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
1 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 
15 (A) NAME /KEY: Soluble 5A/5B Substrate 

Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Thr Trp Thr 

5 10 15 

Gly Lys 

20 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 810 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

30 

(ix) FEATURE: 

(A) NAME /KEY: pNB182A4AHT 



35 ATG AGA GGA TCG CAT CAC CAT CAC CAT CAC ACG GAT CCG CCC ATC 
Met Arg Gly Ser His His His His His His Thr Asp Pro Pro He 
15 10 15 



45 
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ACG GCG TAC GCC 
Thr Ala Tyr Ala 

ACC AGC CTG ACT 
Thr Ser Leu Thr 

CAG ATC GTG TCA 
Gin lie Val Ser 



CAG CAG ACG AGA 
Gin Gin Thr Arg 
20 

GGC CGG GAC AAA 
Gly Arg Asp Lys 
35 

ACT GCT ACC CAA 
Thr Ala Thr Gin 
50 



40- 

GGC CTC CTA GGG 
Gly Leu Leu Gly 
25 

AAC CAA GTG GAG 
Asn Gin Val Glu 
40 

ACC TTC CTG GCA 
Thr Phe Leu Ala 
55 



TGT ATA ATC 90 
Cys He He 
30 

GGT GAG GTC 135 
Gly Glu Val 
45 

ACG TGC ATC 180 
Thr Cys He 
60 



AAT GGG GTA TGC 
Asn Gly Val Cys 

ATC GCA TCA CCC 
He Ala Ser Pro 



TGG ACT GTC TAC 
Trp Thr Val Tyr 
65 

AAG GGT CCT GTC 
Lys Gly Pro Val 
80 



CAC GGG GCC GGA 
His Gly Ala Gly 
70 

ATC CAG ATG TAT 
He Gin Met Tyr 
85 



ACG AGG ACC 225 
Thr Arg Thr 

75 

ACC AAT GTG 270 
Thr Asn Val 
90 



GAC CAA GAC CTT GTG GGC TGG CCC GCT CCT CAA GGT TCC CGC TCA 315 
Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser 

95 100 105 



TTG ACA CCC TGC ACC TGC GGC TCC TCG GAC CTT TAC CTG GTT ACG 360 
Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr 

HO 115 120 



AGG CAC GCC GAC GTC ATT CCC GTG CGC CGG CGA GGT GAT AGC AGG 
Arg His Ala Asp Val He Pro Val Arg Arg Arg Gly Asp Ser Arg 

125 130 135 



405 
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GGT AGC CTG CTT TCG CCC CGG CCC ATT TCC TAC CTA AAA GGC TCC 450 
Gly Ser Leu Leu Ser Pro Arg Pro lie Ser Tyr Leu Lys Gly Ser 

140 145 150 

TCG GGG GGT CCG CTG TTG TGC CCC GCG GGA CAC GCC GTG GGC CTA 495 
Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu 

155 160 165 

TTC AGG GCC GCG GTG TGC ACC CGT GGA GTG ACC AAG GCG GTG GAC 540 
Phe Arg Ala Ala Val Cys Thr Arg Gly Val Thr Lys Ala Val Asp 

170 175 180 

TTT ATC CCT GTG GAG AAC CTA GAG AC A ACC ATG AGA TCC CCG GGG 585 
Phe lie Pro Val Glu Asn Leu Glu Thr Thr Met Arg Ser Pro Gly 

185 190 195 

GTG CTC GTT GGC GGC GTC CTG GCT GCT CTG GCC GCG TAT TGC CTG 630 
Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu 

200 205 210 

TCA ACA GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC TTG TCC GGG 720 
Ser Thr Gly Cys Val Val He Val Gly Arg He Val Leu Ser Gly 

215 220 225 

AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC CAG GAG TTC 765 
Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe 

230 235 240 

GAT GAG ATG GAA GAG TGC CGG AAG AAA AAG * AGA CGC AAG CTT AAT 810 
Asp Glu Met Glu Glu Cys Arg Lys Lys Lys Arg Arg Lys Leu Asn 

245 250 255 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

5 (ix) FEATURE: 

(A) NAME/ KEY: Native NS4A 

TCA ACA TGG GTG CTC GTT GGC GGC GTC CTG GCT GCT CTG GCC GCG 45 
Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 

TAT TGC CTG TCA ACA GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC 90 
Tyr Cys Leu Ser Thr Gly Cys Val Val lie Val Gly\Arg He Val 

20 25 30 



5 



TTG TCC GGG AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC 135 
Leu Ser Gly Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr 

35 40 45 



CAG GAG TTC GAT GAG ATG GAA GAG TGC 
Gin Glu Phe Asp Glu Met Glu Glu Cys 

50 

2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME/KEY: Native 5A/5B Substrate 
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Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Thr Trp Thr 

5 10 15 

Gly 

5 2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

1 0 (C) STRANDEDNESS: 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: polypeptide 

1 5 (ix) FEATURE: 

(A) NAME /KEY: NS3/NS4A Cleavage site 

Cys Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu 

5 10 15 

20 Val Gly Gly Val Leu 

20 

2) INFORMATION FOR SEQ ID NO:27: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: polypeptide 



(ix) FEATURE: 

(A) NAME/KEY: NS4A/4B Cleavage Site 

35 Tyr Gin Glu Phe Asp Glu Met Glu Glu Cys Ser Gin His Leu Pro 

5 10 15 

Tyr lie Glu Gin Gly 

20 
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2) INFORMATION FOR SEQ ID NO.-28: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

1 0 (ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME /KEY: 4B/5A 

15 Tr P Ile Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu 

5 10 15 

Arg Asp Ile Trp Asp 

20 

20 2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

30 (ix) FEATURE: 

(A) NAME/KEY: 

Glu-Asp-Val-Val-Cys-Cys-Acp-Lys 

5 

35 

2) INFORMATION FOR SEQ ID NO:30: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 
(A) NAME /KEY: 



Cys-Val-Val-Ile-Val-Gly-Arg-Ile-Val-Leu-Ser-Gly-Lys 

5 10 



WO 97/43310 
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WE CLAIM: 

1. A bivalent inhibitor of an hepatitis C NS3 protease comprised of a 
5 first peptide linked to a second peptide, said first peptide being a 

subsequence, a mutated subsequence or a mutated full-length sequence 
of a substrate of the hepatitis C NS3 protease and said second peptide 
being a subsequence of a hepatitis C NS4A polypeptide. 

10 2. The bivalent inhibitor of claim 1 selected from the group consisting 
of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, 
SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, and SEQ ID NO: 8. 

3. An inhibitor of an HCV protease comprised of a peptide, said peptide 
1 5 being a subsequence, a mutated subsequence or a mutated full-length 

sequence of a substrate of the HCV NS3 protease. 

4. An inhibitor of claim 3 selected from the group consisting of SEQ ID 
NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, 

20 SEQ ID NO: 14 and SEQ ID NO: 15. 

5. An inhibitor of an HCV NS3 protease comprised of a peptide, said 
peptide being a subsequence, a mutated subsequence or a mutated full- 
length sequence of an HCV NS4A polypeptide. 

25 

6. The use of an inhibitor of an HCV NS3 protease for the manufacture 
of a medicament for treating hepatitis C, wherein the inhibitor is 
comprised of a first peptide linked to a second peptide, said first peptide 
being a subsequence, mutated subsequence or a mutated full-length 

30 sequence of a substrate of the hepatitis C NS3 protease and said second 
peptide being a subsequence, a mutated subsequence or a mutated full- 
length sequence of a hepatitis C NS4A polypeptide. 

7. The use of claim 6 wherein the inhibitor is selected from the group 
35 consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, 

SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, and SEQ ID NO: 8. 
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8. The use of an inhibitor of an HCV NS3 protease for the manufacture 
of a medicament for treating hepatitis C, wherein the inhibitor is 
comprised of a peptide, said peptide being a subsequence, a mutated 
subsequence or a mutated full-length sequence of a substrate of the HCV 

5 NS3 protease. 

9. The use of claim 8 wherein the inhibitor is selected from the group 
consisting of SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 
12, SEQ ID NO: 13, SEQ ID NO: 14 and SEQ ID NO: 15. 

10 

10. The use of an inhibitor of an HCV NS3 protease for the 
manufacture of a medicament for treating hepatitis C, wherein the 
inhibitor is comprised of a peptide, said peptide being a subsequence, a 
mutated subsequence or a mutated full-length subsequence of an HCV 

15 NS4A polypeptide. 

11. A pharmaceutical composition for treating an individual infected 
with hepatitis C virus, said pharmaceutical composition being an 
inhibitor of an HCV NS3 protease, said inhibitor being comprised of a 

20 first peptide linked to a second peptide, said first peptide being a 

subsequence, a mutated subsequence or a mutated full-length sequence 
of a substrate of the hepatitis C NS3 protease and said second peptide 
being a subsequence, a mutated subsequence or a mutated full-length 
sequence of a hepatitis C NS4A polypeptide, and a pharmaceutical 

25 carrier. 

12. The pharmaceutical composition of claim 11 wherein the inhibitor 

is selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ 
ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, and 
30 SEQ ID NO: 8. 

13. A pharmaceutical composition for treating an individual infected 
with hepatitis C virus, said pharmaceutical composition being 
comprised of an inhibitor of an HCV NS3 protease and a pharmaceutical 

35 carrier, said inhibitor being a subsequence, a mutated subsequence or a 
mutated full-length sequence of a substrate of the HCV NS3 protease. 
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14. The pharmaceutical composition of claim 13 wherein the inhibitor is 
selected from the group consisting of SEQ ID NO: 9, SEQ ID NO: 10, SEQ 
ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14 and SEQ ID 
NO: 15. 

15. A pharmaceutical composition for treating an individual infected 
with hepatitis C virus, said pharmaceutical composition being 
comprised of an inhibitor of an HCV NS3 protease and a pharmaceutical 
carrier, wherein said inhibitor is comprised of a peptide, said peptide 
being a subsequence, a mutated subsequence or a mutated full-length 
subsequence of an HCV NS4A polypeptide. 
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